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Please REWRITE claims 1-2, 5-8. 11-13, 25, 27-^28, 37. 39-40, 48, 50, 52-53, 55-56, 
59--61, 64-66, and 69, For the Examiner's convenience, ihis Amendmeni includes the tex; of all 
claims under examination, a parenthetical expression for each claim lo indicate the status of the 
claim, and markings to show changes relative lo the immediate prior version of each curremly 
amended claim. 



(Currently Amended) A system for estimating Ae prevalence of digital content on a 
newortc comprising: 

an estimating device that receives [[stores]] Traffic data collected finom the network; 

an anonymizing device that locates user identification data in the ^r^c data, masks the user 

identification data to produce clean traffic data^ and stores The clean traffic data: 
a sampling device thai stores smnmarizarion data suirnnariz e d - gafee - dsta that describe each 

occiurence of the digital content in the clean traffic data; and 
an accessing device that presents the clean traffic data and the summarization data 

flummaiiaod traffic data to a user. 




2. (Currently Amended) The system of claim 1 , whemn the estimating device receives 
rotriov es the traffic data from at least one proxy cache server. 



3, (Previously Presented) The system of claim 1, wherein the sampling device computes the 
number of impressions of the digital content for a web site on the network. 
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4- (Previously Presented) The system of claim 1 , wherein the sampling device includes: 
a prober that fetches a web page from the network; 

an extractor that locates a fragment of the web page that includes the digital content; and 
a classifier that perfonns a structural analysis of the fragment to classify the digital content 

(Currently Amended) The system of claim I, wherein the accessing device generates a 
report when the clean traffic data or the summarization data sununqriz e d tK^ffic daia satisfy 
at least one crixerion, 

6, (Currently Amended) A method of estmrating flig prevalence of digital content on a 
network, comprising the steps of: 

estimating the global traffic to [[a]] at least one Web site on the network xo provide traffic 
data; 

locating user identificauon data in the traffic data; 
masking the user identification daia to produce clean traffic data: 

statistically sampling the conienis of said [[said]] at least one Web site to provide sampling 
data; 

storing the clean traffic data and the sampling data; and 
accessing the clean traffic data and the sampling data to generate a report. 

7, (Currently Amended) A system for estimating the prevalence of digital content on a network 
connected to at least one network site that includes at least one nerwork server to access at 
least one imifonn resource locator, the system comprising: 
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a database; 

a traffic analysis system that receives traffic data firom a traffic sampUng system, locates 
user identification data in the traffic data, masks the user identification data to 
produce clean traffic data, and stores the clean traffic data fiom a traffic oompling 
system in the database, the traffic data including said at least one unifonn resource 
locator. 

a digital content sampling system that stores the digital content at said at least one unifonn 

resource locator in the database; and 
a statistical summarization system that stores summarization data that describe the digital 

content in the database. 

8, (Currently Amended) The system of claim 7, further comprising: 

a Web firont end thai connects to the network and the database, wherein a client oliont uses a 
browser to connect to the Web from end. 

9, (Previously Presented) The system of claim 7» fimher comprising: 

a user interface thai an accoimt manager, an operator, or* a media editor can use to administer 
the system. 

1 0, (Original) The sj^tem of claim 7, wherein the network is the Internet, and wherein the 
network site is a Web site, 

I I . (Currently Amended) The system of claim 7, wherein to mask the user identification data in 
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the traffic data the xrafGc analysis system replaces the user idgntificaiiQn data with a result 
fiiom processing the user identification data through a crvptQeyaphicallv secure one-wav 
hash fiincrion, further oompriGoa: 

an anonymity systom that roooivoo iho tmffio data s ampl e from th e traffic sampling systcan 

and produces a clean traffio data somplo; ond 
a traffic summarization system that produc e s a sununorization of tho - dean tmffio data 
sampl e and stor e s th e traffic dam sampl e in th e dotabos e . 

12. (Currently Amended) The system of claim \ 1 , wherein the pser jdentjfjcanQyit ^am m4udes 
anoR>TOity s y st wi produoee a oletm traffio data sampl e by remov ^ig a network address or a 
cookie from tho traffio data som ^. 



13* (Currently Amended) The system of claim 1 1 , wherein the summarization of the clean 
traffic data sampl e includes a reference to said at least one uniform resource locator and a 
count [[tally]] of the number of requests for [[times]] said at least one uniform resource 
locator was r e qu e st e d . 

14. (Previously Presented) The system of claim 7, wherein the digital content sampling system 
furxber comprises; 

a probe mapping system that uses the summarization data to create a probe for the 

network, the probe map including a mapping for said at least one uniform resource 
locator; 

a uniform resource locator retrieval system that retrieves said at least one unifonn resource 
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locator from the networtc server, 
a browser emulation environment that conducts a simulation of the display of said at least 

one uniform resource locator in a browser; 
a digital content extractor thai stores the digital content from said at least one uniform 

resource locator in the database; and 
a smictural classifier that stores at least one classification type for the digital content in the 




1 5. (Previously Presented) The system of claim 14, wherein the probe map further comprises: 
a probability of the likelihood that said at least one unifomi resource location will be 

sampled; and 

a scale that detemiines the contribution of said at least one unifomi resource location to the 
suinmari2:ation data, 

16. (Original) The system of claim 14, wherein the simulation includes executing a program 
embedded in said at least one uniform resource locator. 

1 7. (Previously Presented) The system of claim 1 6, wherein the program is a JavaScript script, a 
Java applet, a Perl script, or a common gateway interface program. 

1 8. (Original) The system of claim 14. wherein the simulation includes executing dynamic 
digital content in said at least one unifomi resouTx:e locator 
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19, (Previously Presented) The system of claim 18, wherein the dynamic content is an interlaced 
GIF image, an MPEG movie, or an MPS audio file. 



20. (Previously Presented) The system of claim 14, wherein the digital content extractor 
retrieves the digital content from said at least one uniform resource locator by applying a 
rule set defined by a media editor. 

2 1 . (Previously Presented) The system of claim 14> wherein the digital content extractor 
retrieves the digital content from said at least one imiform resource locator by using an 
automated digital content detection system. 

22. (Previously Presented) The system of claim 21, wherein the automatic digital detection 
system comprises: 

a strucmral detector that locates an XML structure; and 

a feature detector that locates an XML feature within the XML smicuire, 

23. (Previously Presented) The system of claim 14> wherein the structural classifier determines 
said at least one classification type for the digital content. 

24. (Previously Presented) The system of claim 7, wherein the user interface fiinher comprises: 
a system.accouni management interface that assists the account manager with creating and 

modifying an account on the system; 
a siie administration interface that assists the operator with the administration of said at least 
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one nctwoilc site; 

a taxonomy administration interface that assists the media editor with the administration of 
the taxonomy data; 

a digital content classi^cation interface that assists the media editor with the classification of 

the digital content; and 
a rate card collection interface that assists the media editor with the administration of the 

rate card data. 

25 . (Currently Amended) A system for estimating prevalence of digital content on a network, 
comprising: 

a memory device; and 

a processor disposed in communication with the memory device, the processor configured 
to: 

obtain traffic data from at least one Web site on the network; 
locate user identification data in the traflBc data; 
mask the user identification data to produce clean traflBc data: 
compute a nimiber of in^ressions for the digital content in the clean traffic data; 
retrieve the digital content from the clean traffic data to generate sampling data; and 
generate prevalence estimates for the digital content from the clean traffic data and 
the sampling data, 

26. (Previously Presented) The system of claim 25, wherein the processor is further configured 
to: 

I -11 1 Amendment - Page 8 of 25 

48694 vl 



PAGE 12I29'RCVDAT3/4I2004 7:11:49PM [Eastern Standard Time]'SVR:USPTO-EPXV^^ 



y 

03-04-2004 0r:18pin From-MORGANiFINNEGAN 2028573730 T-078 P. 013/029 F-BB5 

Applicaijon Scnal Number 09/695^16 Attorney Docket Number: 4421^001 

1. 1 11 Amendmem daied March 4, 2004 
Reply to OCBcc: Actiou Unred November 4, 2003 

retrieve a Web page fiom said at Jeasi one Web siie: 
extract a firagmeni from the Web page; and 
classify the fragment. 

27. (Currently Amended) The system of claim 25, wherein to mask die user ideniification data 
in the traffic data the processor is further configured to: 

replace the user identification data with a result from processing the user identifjcation data 
through a crvptopraphicaJly secure one-way hash ftmction gonorato the traffio dam 
by r e tri e ving anonymQua traffic data . 

28. (Currently Amended) The system of claim 27, wherein the user identificatjon includes a 
neiwork address or a cookie, procefi s or is further oonfigur e d to: 

rotriovQ onojaymouQ dota sompIoG by romovine data from ifao tmffio data that identifi es a user 
on tho network. 

29. (Previously Presented) The system of claim 25, wherein the processor is further configured 
to: 

classify a fragment within the sampling data. 

30. (Previously Presented) The system of claim 29, wherein die processor is further configurtrd 
to: 

classify the fragment by analyzing the fragment for uniqueness and adding information to a 
database regarding the uniqueness of the fragment. 
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3 1 . (Previously Presented) The system of claim 30, wherein the processor is configured to; 
classify the fragment by detecting a duplicate fragment, 

(Previously Presented) The system of claim 25, wherein the processor is further configured 
to: 

interact wilh a user interface thai administers the system. 

(Previously Presented) The system of claim 25, wherein the processor i$ further configured 
to: 

include unlfonn resource locator information regarding said at least one Web site in the 
traffic data. 

34. (Previously Presented) The system of claim 25, wherein the processor is further configured 
lo: 

perfomi data integrity monitoring of the sampling data. 

35. (Previously Presented) The system of claim 25, wherein the processor is further configured 
to: 

serve as an automatic digital content detection system. 

36. (Previously Presented) The system of claim 35, wherein the automatic advertisement 
detection system applies at least one heuristic algorithm to detect digital content within an 
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HTML or an XML documetit and normalizes the detected HTML or XML content into a 



hierarchical fonn. 



37, (Currently Amended) A method for using a computer lo estimate die prevalence of digital 
content on a network, comprising the steps of: 

obtaining traffic data from at least one Web site on the nerwoik; 

locating user identjficanow dat^ m th^ tragic ^ata; 

masking the user identification data to prxaduce clean traffic data: 

computing a number of impressions for the digital content in the clean traffic data frmnsaid 
at l e ast on e W e b sit e; 

retrieving ib« digital content fiom the clean traffic data to generate sampling data; and 
generating prevalence estimates for the digital content from the clean traffic data and the 
sampling data- 

38, (Previously Presented) The method of claim 37, wherein retrieving the digital content 
further comprises the steps of: 

retrieving a Web page from said at least one Web site; 
extracting a fragment from the Web page; and 
classifying tiie fragment- 

39, (Currently Amended) The method of claim 37, wherein the masking of the user 
identification 4ata in the traffic data ftmher comprises: tho rroffio data is ononvmouG, 
replacing ihe user identification daia with a result from processing the user identification 
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data through a crypio graphically secure one- way hash function. 

40. (Currently AmencJed) The method of claim 39, wherein the user identification includes a 
network address or a coQkie. traffic ^a 4s- made - arKmvna&ua bv romovinQ data ftom tho 
traffio data that idontifios a uQor oo tho notw i wlc. 

41. (Pi^viously Presented) The method ofclaim 37, fimher comprising^ 
I classing a fragment within the sampUng data. 

42. (Previously Presented) The method of claim 41, wherein classifying The fiagmem further 
comprises the steps of: 

analyzing fragment for uniqueness; and 

adding information lo a database regarding the uniqueness of the fragment. 

43. (Previously Presented) The method of claim 42. fimher comprising the step of: 
classifying the fragment by detecting a duplicate fragment 

44. (Previously Presented) The method of claim 37, further comprising the step of: 
mteracting with a user interface that atlmimsters the system. 

45. (Previously Presented) The method of claim 37, further comprising the step of: 
including uniform resource locator information regarding said at least one Web site in the 

traffic data. 
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46. (Previously Presented) The method of claim 37, fimher comprising the step of: 
perfomndng data integrity raouitoring of the sampling data. 

47. (Previously Presented) The method of claim 37, further comprising the steps of: 
performing automatic advertisement detection by applying at least one heuristic algorithm lo 

detect advertising within an HTML or an XML docwnent; and 
nonnali2ing the detected HTML or XML content into a hierarchical form. 

4S. (Currently Amended) A computer readable medium comprising: 

code for obtaining traffic data from at least one Web site on the network; 
code for loc at i ng usei^ identification data th^ traffic data; 
code for masking the user identification data to produce clean traffic data; 
code for computing a number of impressions of digital content in the clean traffic data; 
code for retrieving the digital content from the clean traffic data to generate sampling data; 
and 

code for generating prevalence estimates for the digital conteiu from the clean traffic data 
and the sampling data. 

49. (Previously Presented) The computer readable medium of claim 48, fimher comprising: 
code for retrieving a Web page firom said at least one Web site; 
code for extracting a fragment from the Web page; and 
code to classify the fragment. 
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50. (Currently Amended) A system for estimating prevalence of digital content on a network, 
comprising: 

means for obtaining traffic data from at least one Web site on the network; 
means for locating user identification data in the traffic data; 
TTjcans for piaskjps tt^e ideTipficatjon data to pyodtjce cleai^ p^ffic d^tit; 
means for computing a nwnber of impressions for the digital content in the clean traffic 
data; 

means for retrieving the digital content from the clean traffic data to generate sampling data; 
and 

means for generating prevalence esnmaies of the digital content from the clean iraflfic data 
and the sampling data. 

5 1 . (Previously Presented) The system of claim 50, fimher comprising; 
means for classifying a fragment extracted from a Web page, 

52. (Currently Amended) The system of claim 50, further comprising; 

means for replacing the user identification ^ata widi a result from processing the user 
identification data through a crvptographically secure one-way hash function 
anonymiging the traffro data . 

53. (Currently Amended) A system of estimating prevalence of digital content on a network, 
comprising: 
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means for esiimanng global tiafBc to ai least one Web site on the network to provide traffic 
data; 

means for locating user identification data in the traffic data; 
means for masking the user identification data to produce clean traffic data; 
means for statistically sampling the contents of said at least one Web site to provide 
sampling data; 

means for storing the clean traffic data and the sampling daia; and 
means for generating prevalence estimates for the digital contem by accessing the clean 
traffic data and the sampling data. 

54, (Previously Presented) The system of claim 53, further comprising: 
means for reporting the prevalence estimates to a user. 



55, (Currently Amended) A method &r using a computer to estimate the prevalence of digital 
content on a nerwork, comprising the st^s of: 
receiving traffic data from the nerwork: 
jocating u^er identification data in ^e traffic data; 
masking the user identiftcatjon d^n^ to produce dean traffic data: 
storing the clean traffic data ooU e ot e d from iho network ; 

storing suTmnarizarion data Qununorigod traffic data that describe each occurrence of the 

digital content in the cleaQ traffic data; and 
presenting the clean traffic data and the summarization data Qummorizod troffio data to a 

user. 
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56. (Currently Amended) The method of claim 55, wherem the receiving of the traffic data is 
from at least one proxy servac aioiiQg traffiQ dota fUrthor oompriGOfi the stop of: 
r e tri e ving ih t» m£&o dmo txom ot toast one proxy s e rv e r 



57. (Previously Presented) The method of claim 55, wherein storing summarized rrafBc data 
further comprises the step of: 




computing the number of impressions of the digital content for a web site on the netwoilc. 



58. (Previously Presented) The method of claim 55, wherein storing traffic data further 
comprises the steps of: 

fetching a web page from the network; 

locating a fragment of the web page that includes the digital content; and 
performing a structural analysis of the fragment to classify the digital content 

59, (Currently Amended) The method of claim 55, wherein presenting the clean traffic data and 
the summarization data summariz e d trafiio data further comprises the step of: 
generating areport when the clean traffic data or the summarization data summariz e d traffic 

data satisfy at least one criterion. 



60. (Currently Amended) A system for estimating prevalence of digital content on a network, 
comprising: 
a memory device; and 
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a processor disposed in coxiununicauon with the memoiy device, the processor coofigured 
to: 

receive traffic data finom the network: 

locate user identification data in the traffic data: 

mask the user identification data to pmduce clean traffic data; 

store the clean traffic data ooUootod from tho network ; 

store summarization data sumroaruod troffic data that describe each occurrence of 

the digital content in the clean traffic data; and 
present the clean traffic data and the summarization data summariz e d traffic data to a 
user 

6 1 . (Currently Amended) The system of claim 60, wherein the receivin|g of the traffic data is 

from at least one proxy server, tho prowGPor rotriovoo the traffic data fi^m at least on e proxy 
oooho Qorvor. 



62. (Previously Presented) The system of claim 60. wherein the processor computes the number 
of impressions of the digital content for a web site on the network, 

63. (Previously Presented) The system of claim 60. wherein the processor is fimher configured 
to; 

fetch a web page from the network; 

locate a fiagmeni of the web page that includes the digital content; and 
perform a scruciural analysis of the fragment to classify the digital content. 
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64. (Currently Amended) The system of claim 60, wherein the processor generates a report 
when the clean traffic data or the summariyaiion data fsummorizod traffic data satisfy at least 
one criterion, 

65. (Cmiently Amended) A computer readable medium comprising: 
code for receiving traffic data from the network: 

code for locating user identification data in the traffic da;a; 

code for masking the user identifjcation data to produce clean trafSc data: 

code for storing the clean traffic data ooH e ct e d from th e n e twork : 

code for stoning sanimaii2ation data s ummariz e d traffic da ta that describe each occurrence 

of the digital content in the clean traffic data; and 
code for presenting the clean traffic data and the suinmari2aTiQndata summori^od troffio 

^sm to a user. 

66. (Currently Amended) The computer readable medium of claim 65, the receiving of the 
traffic data is &om at least one proxy server, further oompriGinfi: 

cod e for r e tri e ving th e mSfio data from at looat obq - proxy cache server. 

67. (Previously Presented) The computer readable medium of claim 65, further comprising: 
code for computing the number of impressions of the digital content for a web site on the 

network. 
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68. (Previously Presenied) The computer readable medium of claim 65, further con^>rising: 
code for fetching a web page from the network; 

code for locating a fragment of the web page that includes the digital content; and 
code for perfomiing a structural analysis of the fragment to classify the digital content. 



69. (Currently Amended) The computer readable medium of claim 65, further comprising: 
code for generating a report when the clean traffic data or the summarizatjon data 
summarizod traffio data satisfy at least one criterion. 
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